A Two-Level Morphological Analysis Of Korean

نویسندگان

  • Deok-Bong Kim
  • Sung-Jin Lee
  • Key-Sun Choi
  • Gil-Chang Kim
چکیده

ABSTH,AGT The two-level morThology model has received a grcal deal oJ attention and ha,s been implcmcnlcd for languages like li'ianish, English, JalmnCSe , Ru,ssian, l,'rcnch, and so on. However, this model has been claimed to be inapproprialc ]or Korean morphological analysis, because the complez" conjugation (inflection) and agglutination in word formation, and the syllabic-based representation oa t. worda may lead to a huge a'am-ber of two-level morphological rules, ht this paper, we show that the twoJcvcl model can be succcs,sJully applied to Korean and its rule size i~ limiled to only 52. Art czlensiou of two-level morphology is described for Korean language. 1992) is a well-known comi)u~,at, ional model of morphology , which ha~ adaptability a~ well ~u~ siml)lic-ity. In t)ractice, this mo(M ha.s been successfully al)-.level model ha~ been considered to l)c inapl~rol)riate two-level morphological analysis of Korean is believed to be diliicuit and infcasible because the complex conjugation (inItection) ainl agglutination in word formation , and the syllable based representation of words may lead to a huge mmlber of two-level morphologicM rules. In this paper, we show that the two-level model can be successfully applied to Korean and its rule size is limited to only 52. This paper presents a successful two-lcvel system [*or Korean morphological analysis. The system wa.s ba~ed on a shareware PC-KIMMO (Antworth, 1990); however, wc extended the I/O component of I'(J-KIMMO to handle Korean alphabet HANUUL; we c(m.~,ructed a Korean dictionary and a Korean morphological grammar (i.e., morphotactics and spelling rules) tot the I'G-K1MMO; wc also used a shareware KGI';N (Miles, 1!191) to translate the linguistic spelling rules into the executable automal, a (i.e., tinite state transducers (FSTs)). This paper focuses on the dictionary and the morphologicM grammar for Korcalt. TWO-LEVEL REPRESENTATION OF KOREAN WORDS The two lewd model is conceLned with directly mapping bctwcen two rcprescntations of a word: (1) tile sur]hcefo,'m (SF) ~ it appears in the text, and (2) the lexical ]orm (LF) which is represented ms a sequence of ba.~ic morphs and diacritics (c.g., '+' to mark morpheme me boundary and '~' for word boundary). As a re suit, an input word in the two-level modcl is analyzed by mapping the word itself (SF) to a sequence of le~ ical forms in dictionary without intermediate stages. In this section, we present a two-level representation of i(ore~m words. 'lb understand the two-level description for Korean ntorphology, one should be properly familiar …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-level post-processing for Korean character recognition using morphological analysis and linguistic evaluation

Most of the post-processing methods for character recognition rely on contextual information of character and word-fragment levels. However, due to linguistic characteristics of Korean, such low-level information alone is not sufficient for high-quality character-recognition applications, and we need much higher-level contextual information to improve the recognition results. This paper present...

متن کامل

Korean Morphology with Elementary Two-Level Rules and Rule Features

Although the existing models for two-level morphology have several merits, they also have some limitations. In morphologically complex languages like Korean, it is not easy to develop FSTs that encode very complex two-level rules that can be innnitely generated. And, because lexical idiosyncrasy is encoded by introducing arbitrary diacritics into the lemma of the lexicon, lexical representation...

متن کامل

Chart-driven Connectionist Categorial Parsing of Spoken Korean

While most of the speech and natural language systems which were developed for English and other Indo-European languages neglect the morphological processing and integrate speech and natural language at the word level, for the agglu-tinative languages such as Korean and Japanese, the morphological processing plays a major role in the language processing since these languages have very complex m...

متن کامل

Integrated speech and morphological processing in a connectionist continuous speech understanding for Korean

A new tightly coupled speech and natural language integration model is presented for a TDNN-based continuous possibly large vocabulary speech recognition system for Korean. Unlike popular n-best techniques developed for integrating mainly HMM-based speech recognition and natural language processing in a word level, which is obviously inadequate for morphologically complex agglutinative language...

متن کامل

A Hidden Contributor to the Korean Miracle: The Korean Credit :union: Movement

Korean credit :::union:::s (CUs) are considered to be a hidden contributor to the “Korean miracle”, characterized by remarkable economic growth and relatively low income inequality. The Korean miracle not only generated wealth in an economically strapped and socially under-privileged people, but also contributed to regional community development and the democratization of Korean society. In...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994